Developing Additive Spectral Approach to Fuzzy Clustering

نویسندگان

  • Boris G. Mirkin
  • Susana Nascimento
چکیده

An additive spectral method for fuzzy clustering is presented. The method operates on a clustering model which is an extension of the spectral decomposition of a square matrix. The computation proceeds by extracting clusters one by one, which allows us to draw several stopping rules to the procedure. We experimentally test the performance of our method and show its competitiveness. In spite of the fact that many relational fuzzy clustering algorithms have been developed already [1,2,3,4,12], most of them are ad hoc and, moreover, they all involve manually specified parameters such as the number of clusters or threshold of similarity without providing any guidance for choosing them. We apply a model-based approach of additive clustering, combined with the spectral clustering approach, to develop a novel relational fuzzy clustering method that is both adequate and supplied with model-based parameters helping to choose the right number of clusters. We assume the data in the format of what is called similarity or relational data, that is a matrix W = (wtt′), t, t′ ∈ T , of similarity indexes wtt′ , between objects t, t′ from a set of objects T . We further assume that this similarity values are but manifested expressions of some hidden relational patterns which can be represented by fuzzy clusters. We propose to formalize a relational fuzzy cluster as represented by two items: (i) a membership vector u = (ut), t ∈ T , such that 0 ≤ ut ≤ 1 for all t ∈ T , and (ii) an intensity μ > 0 that expresses the extent of significance of the pattern corresponding to the cluster. With the introduction of the intensity, applied as a scaling factor to u, it is the product μu that is a solution rather than its individual co-factors. Given a value of the product μut, it is impossible to tell which part of it is μ and which ut. To resolve this, we follow a conventional scheme: let us constrain the scale of the membership vector u on a constant level, for example, by a condition such as ∑ t ut = 1 or ∑ t u 2 t = 1, then the remaining factor will define the value of μ. The latter normalization better suits the criterion implied by our fuzzy clustering method and, thus, is accepted further on. S.O. Kuznetsov et al. (Eds.): RSFDGrC 2011, LNAI 6743, pp. 273–277, 2011. c © Springer-Verlag Berlin Heidelberg 2011 274 B. Mirkin and S. Nascimento To make the cluster structure in the similarity matrix sharper, we apply the spectral clustering approach to pre-process a raw similarity matrix W into A by using the so-called normalized Laplacian transformation as related to the popular clustering criterion of normalized cut [6]. This criterion relates to the minimum non-zero eigenvalue of the Laplacian matrix. To change this to the maximum eigenvalue, we further transform this to its pseudo-inverse matrix, which also increases the gaps between eigenvalues. Our additive fuzzy clustering model follows that of [11,7,10] and involves K fuzzy clusters that reproduce the pseudo-inverted Laplacian similarities att′ up to additive errors according to the following equations:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thematic Fuzzy Clusters with an Additive Spectral Approach

This paper introduces an additive fuzzy clustering model for similarity data as oriented towards representation and visualization of activities of research organizations in a hierarchical taxonomy of the field. We propose a one-by-one cluster extracting strategy which leads to a version of spectral clustering approach for similarity data. The derived fuzzy clustering method, FADDIS, is experime...

متن کامل

Laplacian normalization for deriving thematic fuzzy clusters with an additive spectral approach

This paper presents a further investigation into computational properties of a novel fuzzy additive spectral clustering method, FADDIS, recently introduced by authors (Mirkin and Nascimento 2012). Specifically, we extend our analysis to “difficult” data structures from the recent literature and develop two synthetic data generators simulating affinity data of Gaussian clusters and genuine addit...

متن کامل

Additive spectral method for fuzzy cluster analysis of similarity data including community structure and affinity matrices

An additive spectral method for fuzzy clustering is proposed. The method operates on a clustering model which is an extension of the spectral decomposition of a square matrix. The computation proceeds by extracting clusters one by one which makes the spectral approach quite natural. The iterative extraction of clusters, also, allows us to draw several stopping rules to the procedure. This appli...

متن کامل

Constructing and Mapping Fuzzy Thematic Clusters to Higher Ranks in a Taxonomy

We present a method for mapping a structure to a related taxonomy in a thematically consistent way. The components of the structure are supplied with fuzzy profiles over the taxonomy. These are then generalized in two steps: first, by fuzzy clustering, and then by mapping the clusters to higher ranks of the taxonomy. To be specific, we concentrate on the Computer Sciences area represented by th...

متن کامل

Using fuzzy c-means clustering algorithm for common lecturer timetabling among departments

University course timetabling problem is one of the hard problems and it must be done for each term frequently which is an exhausting and time consuming task. The main technique in the presented approach is focused on developing and making the process of timetabling common lecturers among different departments of a university scalable. The aim of this paper is to improve the satisfaction of com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011